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(54) DNA POLYMERASE GENE DERIVED FROM ULTRATHERMOPHIUC PROTOBACTERIUM AND ITS USE 

(5 7) Abstract 

PURPOSE: To obtain a DNA polymerase having heat resistance, excellent 
in thermal stability and stability to organic solvents, also excellent in the 
accuracy in its nucleic acid incorporation, thus useful for DNA sequence 
determination and polymerase chain reaction. . 

CONSTITUTION: This DNA polymerase is derived from ultrathermophilic 
protobacterium KODI, having a molecular weight of about 86-92Kda and 
presenting high thermally stable DNA polymerase activity. This DNA 
polymerase is obtained by culturing recombinant host cells transformed by 
using a DNA recombinant manifestation vector (e.g. pET-pol) which is 
prepared by introducing an isolated DNA coding thjs DNA polymerase into 
e.g. a pET-8c- derived vector. The construction diagram for the DNA 
recombinant manifestation vector is illustrated. 
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fc*. 

[§S#5 2] ^fI^8 6-9 2Kdati5:i 

z&mt? zmtm. i ei©dna* u * 7— t*. 
7— t*. 

[»#JS5] M».^ISMKOD 1 4*0DNA/|'U 
^7— fe'£3- KTS#fl^nfcDNA„ 
[I8#JS6] E?'J#^2tCfgig2*l57 = /®'62?U£ 

5 KE«an*JM*'a*ifcDN A. 
SnSJfliSttifcDNA. 

[Si#Ji8] »jRJ«5fc:E*£*lfcDNAfc'<**- 
fcjf A L fc D N AH&& tf 9 — . 

T£5 Zl <h fcWfttrsaM&H 8 fBISOD N Affl&Afg 
^/ 7 ^_ (pET-po 1) . 
[»*aiO] K#:«8fcE*anSDNA»«A.5B 

£. 

1 2 ] tt*Ji 1 0 (cEttdti^litt^.'S^n 

£££4*«if*iB#*ft»]R*KOD 1 E&*GDDNA^ 
U * 5— feOSSijgffio 

1 3 ] warn i o fcE«snsa»Ais±« 

IS£**U (a) K»SIAg3Eiffla@Sr*ae)fc^. 5£5** 

u mmmm^i^mmv. (to m^^«±iaBflS**o 

e » € -& Ii Sr^if £ t * 4$ ft t T * iB» 
K!!&ElKODl4*DNA#W7-f$«St^ 

ft. 

1 3f2«©jg#?&&I»MKOD 1 ft*DNA:}5U^7- 
L<«9 0*C&±T**-l£fc*«£T*lll:*"*l 5E 



[0 0 0 1] 

D 1 ft*©DNA#'J -tf*SJ:tfKjKy*7--e£ 

=i - i*:r sae^ft 6 zfiz&mfcT&mmrzD n a # 

[0 0 0 2] 

©DNA# 'J 7— f*3«t^^SttifflBtcg5^-r-5 7 t 

wswfcan-cv**. ££«ifi. jHu^7— ffa«s* 

(PCR) «0«nS«H«ffl^«|ll'ft^.DNASffilC«- 

ffl*»3»ttDNAJt?'j / 5— ifccwr *a 

tlTl/i*. PCRR«"rffl^6n*BJRttDNA#»J^ 
5— fetLTli, ±ttTl7-VX • -9— ^7^ 7X(T 
hermus thermophi lus) fi3f*:tf>DN A# U * v— fe*(Tth7i* 
■J*7— fe*> +>— • 7^7^AX(Thermus aqu 
aticus) E&3f5£>DNA#'J ■ fe(Taq#'J ■ fe) 

[0 0 0 3] 

[^W^/Sftbcfcpt-T-SiSS] l,*vLfcj&«&, ft**] 
£ttTV>-3»f&"fDNA#U*7-- tflttt. 

C*e"F. BHBS58L/T^*. mm<DWD&fr<bl& 

iBT5l|j|a»ittDNA#iJ/7- ^MSftT 
nfc. ^fcbfnin • 7 U (Pyrococcus furi 
osus) a*OB*RttDNA#U/ 7— fc*(Pfu#U*7- 
■fe*. WO92/09689, #5fl¥5-328969^£tf8) , t-t^y 
AX - U h 7 U 7. (Thermococcus 1 i toral is) 663t5<£>I^ 
ttDNAJpU — t? (Tli#U^7— fe\ <t#M¥6-7160 

■f&sg) affeasntus. i^Lte^, cms® 

»^titDNA#'J/7- fe'tt, ^S-O3j0}i^CD^<7) 
iESHtttT a gDNA#iJ^7-Wh eDNA^'J 
■ fetcJt^gtlT^^^. -S£fcfc©T?"ifc<i*r« 
a»J»ttDNA#>J >" 5-l?lJ«aiftTl»fc. 
[0 0 0 4] 

^HJJcSUaL-fc. r3Ctto-6*5BW»4a»JI!a6Jll*KOD 

[0 0 0 5] Sfc^-Jfl'iSJf^MKKOD 1 S*© 
DNA#'J;*7— tfSrll— HT-5*f8^nfcDN AT* 

[0 0 0 6] Z $\ZttWtMR1MhWmYLO\> 1 fi3t5 
ODNA#U/5— t?*fe3— l<"r*l|lll*nfcDNAt 

^^7^-'Cjf ALitDNAffl^A^^i'^-TS-S). 
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[0007] £tz*$tw\*mti?-m%immKOD 1 a*© 

D N A# U * y — fe? £ n - h"f -smgl £ tut D N A S:^ 

^ * - icif a u £ d n am ausi^ ^ * - ^xm 
[0008] ^mmitmn^mmKOD i **odn 

-K^AL£DNAi|ia&;US51^*-£ffl^TJ&«e 

d n a# u * 7— tf ^susnr-s c t zftmt-rzmft 

f&i&MWKOD 1 ft*0DNA*U Ay — tf©SiSfe-C 
[0 0 0 9] $£#fgHJ{ij@ffgi&gCKKOD i 

UpH 

mmmmm. 

DNACGC^I 

[ooi i] mttf-mt&MmKOD gf*5i/im 

M^WttW^SP f uDNA/^U *y— t?£KM(Pyroc 
occus furiosus) &<fctfT 1 i (Vent) DNA^'J 
^ 7 — tf £g£fi (Thermococcus 1 i t oral i s) t <DM&ffl& 

[0012] *«^©BJi!MtDNA#ij^7— trae^ 

WUmt. P f uDNA*U/7-tfOffiSI«7 5 
y&ffi^J (Nucleic Acids Research, 1993, vol.21, No. 
2, 259-265) KS^*, T^-f -?-£^§rU £f£T 

[0 0 13] i-rSSf^g»KOD lt*CDSfefe#DN 

t5) fcfflHTPCRKJ&Srfrl^ DN A#rtf-£ifi@£ 
■li-S. it"l@£nAci9rJt©DNAE?iJ («. E?J#^6) 
£&5£U S^SUtTS/gEE^a-b'LT^S 

>^*js-r-5o §wtti.DNA^u^7- t?»e^£ 

^trfrtf-cofcis $ 4 ~ 7 K b p idS^f 

[0 0 14] 5fC> &4~7 Kb p©DNA»rJt£yjP 
^e>®>RU ZinSrJB^T. ^SlgKTDNA^-f Zfy 
'J-£f£SJU ±IBIEKOPCRiii|sDNA»r)t (ffl. 
E?iJ#^6) J^o-^CJor-Myj^ 

[0 0 15] *^ld45UT^D->-fbUfcKOD 1* 



D N A # >J ^ 7— fef £ 3 - K T -5 * IK $ tl D N A £ ^ 
»ESISnfca«;Ut±IIBl&*«*U (a) I£*&&;L 

(b) ffi&;U§±ifflli§fi*©^*e£a«£!$ST5:ne 
£^fr£££1#gi£r3j@#&teJ!CMKOD 1 e£*DN 
A# U ;* y— b'^S3-r 5£ftT&£o 

[0 0 10] *^BJICi3^TfiefflT^SiiFl^telIS« 1 

*• 
S*« 

6 5~1 0 0"C 
9 5"C 
5~9 
6 

2~3% 
3 8% 

ODNA#'J^7-tflgftt5 0 10II (jtST5 
/K1 6 7 0M) nT 1^-5 (E^iJS^-1) . 

fl&©DNA#U*5 — VtikgiLtztZZ. *^B^CDit 
fifC 5«DNA#U/7- 

Reg.ionl~5i!|5ffaiTH^ SfcflHte 
^ON*SfiUC3* -5' X* 7*^1/7--^^ -7 
TWEXOl, 2, 3«5#fiU!^ ; 

KODitt**©iK^ttDNA^u fc*jf£T©& 

#««c. R e g i o n 1 , 2 flKtt. &*;frffiE?iJ## 
ffiUT430. ^O^"— 7 p >U-x^f yf 71/-A (OR 
F) CO^^nfc^TOfid^T^-So 
[0 0 16] SffSifeHlKOD l«tOBjSittDNA^ 

U ^"-y-X (Pyrococcus furiosus) 4*OP f uDNA^ 
U^7-ifl£f 5-328969 ^$8) . RtfV 

—^ziyfsTs • ') b y U X (Thermococcus litoralis)S 
*OT1 i (Ve n t) DNA#U*7— fcfite^ (# 
M¥ 6-7160 ^£$8) tit^T-5t. *^BJ©KOD 1 
^ae^tc^ffiEJFiJAt^ffiT'i)^ ±KP f uDN 
A# 'J ^ 7— ri©)ie^lctt^ttE5iJ«#ffi-li-T. * & 
Ti i DNA#U>7- tfae^icti. 2 fflorfrftEJiJ 

Region2, 3 CDftTS 0 . *^BJ»KOD 1 #£CD 
BUftttD N A# U ^ 7 — tfite^ rtO^T«EE^JcD#«E© 
^ft«7C€<S^t3T^'5 (B4#iB) . 

[0 0 17] #5^©SeTtei@0&&I§CMKOD 1 * 
*©DNA#U ^^-1f5-3- KT5DNATS5. g 
DNAO-«liI?!IS§l $£tt2 t|2m$n-575/ 
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OD 1#&*©»£M4DNA#'J*7-- tr*^H»T^ 
3E£-tf-5fc«£>. E^JS^lK^txS&SEjajWl 3 7 
4~2 4 5 3 bp. 2708~4316bp CD;fr£E?>J 

a#u^7— eae^tjwuis-r*. ^e 

pints'? d- >{t ufciie^* 3 a©y 5-f ^-©.m 

$n^3WM-srii'is-rso i^tiffln^^fv-j 

fflt»T-€-0*S|oa*-r*E5«lt*Jffl l/T P C RKJSS: 
fflt»ra»KPCREi6*ffV». ^«EBJI6**&l»KO 

[0 0 18] *ISil:l5^T«ffit5'?J'i'-H ,KO 
D 1 ft5fc©KgH4DNA#U V<D9 U—~><f& 

«k<. «*.B7r-5?*s«ktf:/9;*s en*. 

fllttT^XS F^**-, W*.«pE-T-8cfcE*ije 
tf*dt*«T*<&. tfc»Jft^5X3 h*©#J£:LTtt, 
pUCl 9, pBR3 2 2. pBluescript. 
PSP73, pGW7, p E T 3 A, pETllC&t* 
j&<&-5. 7 7-ytL-m fctMAgtll, AD 
ASH, AZapI I&&J»*tf&n*. #$6ipJI;:*3H 

en-§>„ icif turn enAtfj m 1 0 9, 101, 

XLl, PR1, BL2 1 (DE3) plysSftt"^ 
mi <btlZ>. *^^T«±a2KOD 1 S3fe©W!»H4DN 
A # • J ;* 7 — bf * 3 — K T -5 £ ± IB ^ ? 9 — \ Z if 

[0 0 19] *56W©«i!tffiTW:. ±Eaai*.1f±IM& 
***LT, KODlHeS*©Bf»ttDNA#y*7- 

■trae j fsB*«ss-&*. mmz-mummommizm 
«, KODi«a*o^flEia5«ie^sftn^!4»©D 

t?»fi^£-£typET-8 c^7XS H 
yyohf;*^- /3-D- #7? h->HOtSAKcJ:0fTa:^ 

[0020] *%mommmT\z. urnxm^mmzm 
nut*. <a) tttt^.ia±iua«iite>&«. swu 

ttfl&ttffltt&HSU (b) ?S±*ffl8Sli3*©;F*EMei« 



-5. ttBtteiSffttl::f?nfllLft:». lg#i£$aS. ^-f 
/ 5.A ■ 7U>?7UXmiz£Qm# : $:%LVi'-r : b. 
■T. JIM&l&ffcBU ±AI«tOBJ!!HtDNA3j5U>t5- 

[0 p.2.1]-C©«CLTW#Lfca»*RttJRBKOD 
l*a*OB«HtD-NA#.U^5— t?©^*tt. *$9 
0KDaT*« (H2#M> . 

[0 0 2 2] Sfc. iOB«MSDNAjR«J^7— tfSffl 

A*rM-©WBaHKBSn* (0 3#i) . 
[0 0 2 3] 

[0 0 2 4] 

[*M] %.\z*mw&mmm*m^Tmwtz>. 
mmmi 

9 5tl;T«S^ B^SrHIiRbfCo »5*l&B#a»S 
*i*fcfl^MB#S*te«MKOD l#©*fittDNA*W 
SSLfc. Pyrococcus furiosus S3fcODNA/l?U ^7 — 
If (Pfu#'J^5-W ©«£Ffi5*7SyBE*H::* 
2107*7^7- (5' -GCATTAGTATAGTGCCAATGGAA 
GGCGAC-3' (EJiJS-^ 4 ) , 5' -GAGGGCGAAGTTTATTCCGAGCTT 
-3" (E*HMI5) £-&i&Lfc. ^ffl2iffl7*7'fV-5: 
ftfflU B»bfcftfefrDNASJ»S!tLT, PCRS 

[0 0 2 5] PCRiiifsDNAI&rM-W^aE^J (E^J# 
*f 6) *ft£U 75/&EJIJ (E?»J#^7) 

£©«MBDNAKfi-fc:/D-7a:LT. KOD 1 

mm^DNAmmmmmmmmzM uxtttf > /w ^ 

■f*mW<D J *'1X&&ib1Z (D4-7Kbp) . S6 
K. ^0**S©DNAWrM-*7*n— ^y;W»6®« 
U ^7X5HpBS Uh7^y->ttS) KJfA 
U Cti&©«£#HcJ:B*BB<E.coli JM109)£JgR 
£&LT. 7^7"7'J-^SW:. 1hf>M^'jy 
■f-tf— >3>CfltfflL&^P-y (E?iJ#^6) fcfflu 
T, ^uy-f-if— >a >£fri^ ±E9-f 
KOD l»*3lEODNA#U ^7— tfjg 
e^***-r*t*A6n*^a— >«c(E.coli JM109/ 
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pBSKODl)£ff#L£„ 

[0026] mmm 2 

nfomi-cm%Vtz?a->&i. E.coli JM1 09/pBSKODl 
iO^^SK. BSK0D1£[hHKU S?£t;:fiE^:K«E?iJ 

^£7 = /I?E?iJ£}i5eL£, KODlftfi*©DNA 
^'J/^-flfifliSO 1 OlS^bSO, 167 0 

[0027] nmm 3 

ffiffi^iJSB^ (1 3 7 4~2 4 5 3 b p, 2 7 0 8-43 
16 bp) ^PCRffi!^fe{cJ;i95c0^^fco PCRHS 
£ffiTte. ^D->»«fc 0 SIR Lfc 7^X2 HtrMS 
fc. 3a«y7-fv- (E?«#^8~13) 
*T. #*PCR£frV>. 7>ftE5>J£Bfctr»fc3«H-£«l 

W:EcoRV, C*ifl:BamHI) A»||ia4ilft 
kolzmmhtz. *l»T, PCRlt^M-^. *gjg±* 

it. nmzm&±, ^^z&mr^mh-t. c*sb«k 

fi«r«WM-*S'&b. PCR£#*©»rK-£:7"7-r v 

flH>T»aPCR£fT^. fl-ffiE^WBtOI&JWU N* 
Sl:EcoRV, C^vglCB amH I -tM hSrWTSK 
OD lttfi*0DNA*U/7- t?*n— 
fOiefSrfrSlUbfc. IK, l^jggT£T 7 7*n 
^-*-T8§#i'J1ifcfESI'^$'— , pET-8c(ON 
coI/BamHIt-fh. SfcKjfflfcBUfcSiJIEiiasIM 

- (pET-po 1) 

[0 0 2 8] *SS0iJ 4 
KOD 1 ajfeDNA^'J^ 9— giOfgg^Wg 
ISItfl3TB*Ufc!ia«**«rC**- (pET-po 
1) Zm^T±mW (E.coli JH109)€»1ME&U #£> 
tlfcflfiMJTBW (Molecular Cloning, p. A. 2, 

1989tB«) T**U »HneframJHT7yp ; &- 

^-owsMaat-f yyntr*^- /3-d- #7*h->K 

•CfcT2 0«flMI8U ?S±IBI&**©^«^>/t^H 
SWftLfc. **H#*ato#«LTl*£U KOD 
1 {*&*0>B!IM£DN A# U ;* 7— tf 
[0 0 2 9] 5 



KOD 1 ffl3fcBHMtDNA#'J*7— gflPWB 
&K0!l4'?&£nfcKOD 1 S*Wi!M£DNA#'J*5 

- <Z)#T»£ S D S - P A G E m \Z «fc o TXsbtz <h d 
5. til 8 6-9 2 kDaT^^fe (H2) . 
W4T»fcKOD 1 fi*OlMDNA/i5U^7- tft 

VMavmrn • v-srffl^TPCR^igLrctc 

•5. it— ^37^)7, • U K 7 U X (Thermococcus litora 
lis) fi5feO»»i4DNA#'J/7— g*JflV>*:*££ 
B«i:I65iT5DNARfr#iBSft (03) . Kt> 
!RStttDNA#'J ^7-KIStt«»*BStifc. 
[0 0 3 0] JtttM 1 

*gwoa»^BatKOD i aigMti-pfesfcHton 

3fcTa3-y7JX • "7 'J a-frX (Pyrococcus furiosus) £ 
fcte-tr— ^J-yftX • U H 7 U X (Thermococcus litora 
lis) fl3feOB*MtDNAtfUj*7--«tg ; P£CPjt<g 
**9i©iH#J(MMKSKOD 1 **©DNA#U*7- 

■ttte^p. (mmmn 3 ) , eD3!/^-7Wx(p 

yrococcus furiosus) fi3f£©If#JH4DNA# , J ^ 7— tf 
0f#lffl¥ 5-328969 ^4i*) . It-W?*^ • 
U h^UX (Thermococcus litoralis) fi3fe©Wfgtf4D 
NA#'J^7- trite? (¥fffl¥ 6-7160 *f&$8) OD 
NAEaWSTSyBtEJilSltJeU tt(8ttltb&. * 
^CKODlS*fflDNA#'J^7-tU n&±®) 
ST&SaDNAjtfU;<7-i?©##S«T»SRe g 
i onl~5*i#4bTW; t £ fcNsfciMlKH: 3 1 - 
5' X+V^^l/7-1f : &f-7Tib4EX01, 2, 
3i>^tELX^tz. LfrL* aDNA#'J;*7— 
ffiJSR egionliRegion 2<D[*l\Z\*. 
flEEJUVS-A. IVS-B^#4lTHfe (04# 
SO . —73. ea^7*X • 7 'J (Pyrococcus fu 

riosus) S*OB«HtDNA#U^7— tfT»*P f u 
#»J * 7--tffctt7>ffiE?WU¥ttL&a*:?fc. 
^37*^ • U h^'JX (Thermococcus litoralis) & 
^OiitDNA/K'J/^-fTi^Ven t#'J;*7 
— t?T«. (iDNA#U^7-ifSSS«Region 
2tRegion 3 ©1*1!;:. 3>&E?iJ IVSltIVS 
2^|gJ?>e>nfc (H4»I) . 
[0 0 3 1] 
[E?iJ*] 
E^iJS^- 1 
E?>J©ft£ : 5342 
E^JOS : mm. (DNA) 
ii©& : 2*IB 
hn#S>- : Klgtt 
E5IJOSS : c DNA 

flc« : KOD 1 
E«o»« 

156-5165 P CDS 
1374-2453 SttESM 
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2708-4316 itftMm 

mm 

GCTTGAGGGC CTGCGGTTAT GGGACGTTGC AGTTTGCGCC TACTCAAAGA TGCCGGTTTT 60 
ATAACGGAGA AAAATGGGGA GCTATTACGA TCTCTCCTTG ATGTGGGGTT TACAATAAAG 1 20 
CCTGGATTGT TCTACAAGAT TATGGGGGAT GAAAG ATG ATC CTC GAC ACT GAC 173 

Met lie Leu Asp Thr Asp 
1 5 

TAC ATA ACC GAG GAT GGA AAG CCT GTC ATA AGA ATT TTC AAG AAG GAA 221 
Tyr lie Thr GIu Asp Gly Lys Pro Val He Arg He Phe Lys Lys Glu 

10 15 20 

AAC GGC GAG TTT AAG ATT GAG TAC GAC CGG ACT TTT GAA CCC TAC TTC 269 
Asn Gly Glu Phe Lys He Glu Tyr Asp Arg Thr Phe Glu Pro Tyr Phe 

25 30 1 35 

TAC GCC CTC CTG AAG GAC GAT TCT GCC ATT GAG GAA GTC AAG AAG ATA 317 
Tyr Ala Leu Leu Lys Asp Asp Ser Ala lie Glu Glu Val Lys Lys He 

40 45 50 

ACC GCC GAG AGG CAC GGG ACG GTT GTA ACG GTT AAG CGG GTT GAA AAG 365 
Thr Ala Glu Arg His Gly Thr Val Val Thr Val Lys Arg Val Glu Lys 
55 60 65 70 

GTT CAG AAG AAG TTC CTC GGG AGA CCA GTT GAG GTC TGG AAA CTC TAC 413 
Val Gin Lys Lys Phe Leu Gly Arg Pro Val Glu Val Trp Lys Leu Tyr 

75 80 85 

TTT ACT CAT CCG CAG GAC GTC CCA GCG ATA AGG GAC AAG ATA CGA GAG 461 
Phe Thr His Pro Gin Asp Val Pro Ala He Arg Asp Lys He Arg Glu 

90 95 100 

CAT GGA GCA GTT ATT GAC ATC TAC GAG TAC GAC ATA CCC TTC GCC AAG 509 
His Gly Ala Val He Asp He Tyr Glu Tyr Asp He Pro Phe Ala Lys 

105 110 115 

CGC TAC CTC ATA GAC AAG GGA TTA GTG CCA ATG GAA GGC GAC GAG GAG 557 
Arg Tyr Leu He Asp Lys Gly Leu Val Pro Met Glu Gly Asp Glu Glu 

120 125 130 

CTG AAA ATG CTC GCC TTC GAC ATT CAA ACT CTC TAC CAT GAG GGC GAG 605 
Leu Lys Met Leu Ala Phe Asp He Gin Thr Leu Tyr His Glu Gly Glu 
135 140 145 150 

GAG TTC GCC GAG GGG CCA ATC CTT ATG ATA AGC TAC GCC GAC GAG GAA 653 
Glu Phe Ala Glu Gly Pro He Leu Met He Ser Tyr Ala Asp Glu Glu 

155 160 165 

GGG GCC AGG GTG ATA ACT TGG AAG AAC GTG GAT CTC CCC TAC GTT GAC 701 
Gly Ala Arg Val He Thr Trp Lys Asn Vai Asp Leu Pro Tyr Val Asp 

170 175 180 

GTC GTC TCG ACG GAG AGG GAG ATG ATA AAG CGC TTC CTC CGT GTT GTG 749 
Val Val Ser Thr Glu Arg Glu Met He Lys Arg Phe Leu Arg Val Val 

185 190 195 

AAG GAG AAA GAC CCG GAC GTT CTC ATA ACC TAC AAC GGC GAC AAC TTC 797 
Lys Glu Lys Asp Pro Asp Va! Leu He Thr Tyr Asn Gly Asp Asn Phe 

200 205 210 

GAC TTC GCC TAT CTG AAA AAG CGC TGT GAA AAG CTC GGA ATA AAC TTC 845 
Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu Lys Leu Gly He Asn Phe 
215 220 225 230 

GCC CTC GGA AGG GAT GGA AGC GAG CCG AAG ATT CAG AGG ATG GGC GAC 893 
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Ala Leu 

AGG TTT 
Arg Phe 

GTG ATA 
Val He 

TAT GAA 
Tyr Glu 
280 
ATA ACA 
lie Thr 
295 

TAC TCG 
Tyr Ser 

CTT CCG 
Leu Pro 

GAC GTC 
Asp Val 

AGG-AAG 
Arg Lys 
360 
AAG GAG 
Lys Glu 
375 

GAG CCC 
Glu Pro 

TGC CAT 
Cys His 

AAC ATC 
Asn He 

TGG CAG 
Trp Gin 
440 
GTA AAC 
Val Asn 
455 

GTT ACA 
Val Thr 

TCT TTC 
Ser Phe 



Gly Arg Asp 
235 

GCC GTC GAA 
Ala Val Glu 

250 
AGA CGG ACG 
Arg Arg Thr 
265 

GCC GTC TTC 
Ala Val Phe 

CCA GCC TGG 
Pro Ala Trp 

ATG GAA GAT 
Met Glu Asp 
315 

ATG GAG GCC 
Met Glu Ala 
330 

TCC CGC TCC- 
Ser Arg Ser 
345 

GCC TAT GAG 
Ala Tyr Glu 

CTG GCC AGA 
Leu Ala Arg 

GAG AGA GGG 
Glu Arg Gly 
395 

CCA GCC GAT 
Pro Ala Asp 

410 
AGC GAG GTT 
Ser Glu Val 
425 

AGA GTT AGA 
Arg Val Arg 

ATA AAC GGG 
He Asn Gly 

AAG AAC GAA 
Lys Asn Glu 
475 

CTT ACT AAA 
Leu Thr Lys 
490 



Gly Ser Glu Pro Lys He Gin Arg Met 
240 

GTG AAG GGA CGG ATA CAC TTC 
Val Lys Gly Arg He His Phe 
255 

CCC ACA TAC ACG 
Pro Thr Tyr Thr 



ATA AAC CTG 
He Asn Leu 
270 

GGT CAG CCG 
Gly Gin Pro 

285 
GAA ACC GGC 
Glu Thr Gly 
300 

GCG AAG GTC 
Ala Lys Val 

CAG CTT TCT 
Gin Leu Ser 

AGC ACT GGC 
Ser Thr Gly 
350 

AGG AAT GAG 
Arg Asn Glu 

365 
AGA CGG CAG 
Arg Arg Gin 
380 

TTG TGG GAG 
Leu Trp Glu 

ACG AAG GTT 
Thr Lys Val 

CAG GAA GGT 
Gin Glu Gly 
430 

AAA GTA TGG 
Lys Val Trp 

445 
TTA AAG TGT 
Leu Lys Cys 
460 

CGA CAA ACG 
Arg Gin Thr 

AAA GTT AAG 
Lys Val Lys 



AAG GAG 
Lys Glu 

GAG AAC 
Glu Asn 

ACA TAC 
Thr Tyr 
320 
CGC TTA 
Arg Leu 
335 

AAC CTC 
Asn Leu 

CTG GCC 
Leu Ala 

AGC TAT 
Ser Tyr 

AAC ATA 

Asn lie 
400 

GTC GTC 

Val Val 
415 

GAC TAT 

Asp Tyr 

GAA TAC 
Glu Tyr 

ACG CCC 
Thr Pro 

AGA ATA 
Arg He 
480 
GGC AAG 
Gly Lys 
495 



AAG GTT 
Lys Val 
290 
CTT GAG 
Leu Glu 
305 

GAG CTT 
Glu Leu 

ATC GGC 
He Gly 

GTT GAG 
Val Glu 

CCG AAC 
Pro Asn 
370 
GAA GGA 
Glu Gly 
385 

GTG TAC 
Val Tyr 

AAG GGG 
Lys Gly 

GTC CTT 
Val Leu 

GAC TAC 
Asp Tyr 
450 
AAT CAT 
Asn His 
465 

AGA GAC 
Arg Asp 

ATA ATA 
He He 



GAT CTC 
Asp Leu 
260 
CTT GAG 
Leu Glu 
275 

TAC GCT 
Tyr Ala 

AGA GTC 
Arg Val 

GGG AAG 
Gly Lys 

CAG TCC 
Gin Ser 
340 
TGG TTC 
Trp Phe 
355 

AAG CCC 
Lys Pro 

GGC TAT 
Gly Tyr 

CTA GAT 
Leu Asp 

AAG GGG 
Lys Gly 
420 
GGG ATT 
Gly He 
435 

AAA GGG 
Lys Gly 



Gly Asp 
245 

TAT CCT 
Tyr Pro 

GCC GTT 
Ala Val 

GAG GAA 
Glu Glu 

GCC CGC 

Ala Arg 
310 

GAG TTC 

Glu Phe 
325 

CTC TGG 

Leu Trp 

CTC CTC 
Leu Leu 

GAT GAA 
Asp Glu 

GTA AAA 
Val Lys 
390 
TTT AGA 
Phe Arg 
405 

ATT ATA 
He lie 

GAC GGC 
Asp Gly 

GAG CTT 
Glu Leu 



AAG CTT CCC GTT 
Lys Leu Pro Val 
470 

AGT CTT GCT AAG 
Ser Leu Ala Lys 
485 

ACC ACT CCC CTT 
Thr Thr Pro Leu 
500 



941 



989 



1037 



1085 



1133 



1181 



1229 



1277 



1325 



1373 



1421 



1469 



1517 



1565 



1613 



1661 
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TTC TAT 
Phe Tyr 

GTT CTC 
Val Leu 
520 
TTG AGG 
Leu Arg 
535 

ATT TCA 
He Ser 

GAG TTT 
Glu Phe 

ACT CCA 
Thr Pro 

GTT GCG 
Val Ala 
600 
ATA GAG 
He Giu 
615 

GAC GGT 
Asp Gly 

ACA AAG 
Thr Lys 

CTT GGT 
Leu Gly 

AAA GAT 
Lys Asp 
680 
ATA CTG 
He Leu 
695 

CTG CTT 
Leu Leu 

AAT GGA 
Asn Gly 

GAG GGC 
Glu Gly 

GCC AAT 
Ala Asn 



GAA ATA 
Glu He 
505 

AAG GGA 
Lys Gly 

AAA GAC 
Lys Asp 

CAC CAG 
His Gin 

AGG GAT 
Arg Asp 
570 
AGC ATC 
Ser He 
585 

AAG AAG 
Lys Lys 

TCC CTA 
Ser Leu 

TCA GTA 
Ser Val 

AAC GAG 
Asn Glu 
650 
ATC CCT 
He Pro 
665 

CGG AGC 
Arg Ser 

TTC CAA 
Phe Gin 

AAT AAG 
Asn Lys 

TTT TAC 
Phe Tyr 
730 
AAG GTC 
Lys Val 
745 

GGC ATA 
Gly lie 



GGC AGA GCG 
Gly Arg Ala 

GAG CTC GCT 
Glu Leu Ala 

525 

GTT GAA TAC 
Val Glu Tyr 

540 
TAT CGT GTT 
Tyr Arg Val 
555 

CGT ATC ACA 
Arg He Thr 

TCG GAG AAG 
Ser Glu Lys 

AAT GTT TAT 
Asn Val Tyr 
605 

CAT GCC CCC 
His Ala Pro 

620 
AAC AGG GTT 
Asn Arg Val 
635 

TGG AAG ATT 
Trp Lys lie 

CAT CAA ACG 
His Gin Thr 

AGG TAT ATA 
Arg Tyr He 
685 

ACA CTC ATT 
Thr Leu He 

700 
GCA ATA TCT 
Ala He Ser 
715 

AGG CTC AGT 
Arg Leu Ser 

TAT GAC TTA 
Tyr Asp Leu 

TTG ACC CAT 
Leu Thr His 



ACA AGT GAG 
Thr Ser Glu 
510 

GGC ATA CTA 
Gly He Leu 

TTT GAT TCA 
Phe Asp Ser 

GAG ATA ACC 
Glu He Thr 
560 

TAC ATT TTT 
Tyr He Phe 

575* 
AAA GGA ACT 
Lys Gly Thr 
590 

CTT AAA GTC 
Leu Lys Val 

TCG GTT CTC 
Ser Val Leu 

AGG AGG AGT 
Arg Arg Ser 
640 

AAA CTG GTG 
Lys Leu Val 

655 
TAC ACG TAT 
Tyr Thr Tyr 
670 

CTG GAG ATA 
Leu Glu He 

GGA TTC ATC 
Gly Phe He 

CAG AGG GAA 
Gin Arg Glu 
720 

GAA TTC AAT 
Glu Phe Asn 

735 
ACT CTT GAA 
Thr Leu Glu 
750 

AAC TCC CTG 
Asn Ser Leu 



AAT ATT 
Asn He 

TTG GCT 
Leu Ala 
530 
TCC CGC 
Ser Arg 
545 

ATT GGG 
He Gly 

GAG CGT 
Glu Arg 

AAC GCA 
Asn Ala 

AAG GAA 
Lys Glu 
610 
AGG GGA 
Arg Gly 
625 

ATT GTT 
He Val 

TCA AAA 
Ser Lys 

CAG TAT 
Gin Tyr 

ACT GGA 
Thr Gly 
690 
AGT GAA 
Ser Glu 
705 

ATG AAC 
Met Asn 

GTC AGC 
Val Ser 

GGA ACT 
Gly Thr 

TAC CCC 
Tyr Pro 



CCA GAA GAA GAG 
Pro Glu Glu Glu 
515 

GAA GGA ACG CTC 
Glu Gly Thr Leu 



AAA AAA 
Lys Lys 

AAA GAC 
Lys Asp 

TTG TTT 
Leu Phe 
580 
GTA ACA 
Val Thr 
595 

ATT ATG 
He Met 

TTC TTC 
Phe Phe 

GCA ACC 
Ala Thr 

CTG CTC 
Leu Leu 
660 
CAG GAA 
Gin Glu 
675 

AAG GAC 
Lys Asp 

AGA AAG 
Arg Lys 

AAC TTG 
Asn Leu 

ACG GAA 
Thr Glu 
740 
CCC TAC 
Pro Tyr 
755 

TCA ATC 
Ser He 



CGG AGG 
Arg Arg 
550 
GAG GAG 
Glu Glu 
565 

GGG ATT 
Gly He 

CTC AAA 
Leu Lys 

GAC AAC 
Asp Asn 

GAA GGC 
Glu Gly 
630 
CAG GGT 
Gin Gly 
645 

TCC CAG 
Ser Gin 

AAT GGG 
Asn Gly 

GGA TTG 
Gly Leu 

AAC GCT 
Asn Ala 
710 
GAA AAC 
Glu Asn 
725 

TAC TAT 
Tyr Tyr 

TAC TTT 
Tyr Phe 

ATC ATC 
He He 



1709 



1757 



1805 



1853 



1901 



1949 



1997 



2045 



2093 



2141 



2189 



2237 



2285 



2333 



2381 



2429 



2477 
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760 

ACC CAC AAC GTC 
Thr His Asn Val 
775 

TAT GAC GTT GCC 
Tyr Asp Val Ala 



GGA TTT 
Gly Phe 

ATA AAG 
lie Lys 

CTC GAT 
Leu Asp 
840 
CCC GAG 
Pro Glu 
855 

AGG ATT 
Arg He 

GTA AAG 
Val Lys 

GTC CCG 
Val Pro 

AAG GCC 
Lys Ala 
920 
CTG AAG 
Leu Lys 
935 

TCT GTG 
Ser Val 



ATC CCG 
He Pro 
810 
AAG AAG 
Lys Lys 
825 

TAC AGG 
Tyr Arg 

GAA TGG 
Glu Trp 

GGA GAG 
Gly Glu 

AGA GAG 
Arg Glu 
890 
TCC TTT 
Ser Phe 
905 

CTG ATT 
Leu lie 

TCG GGG 
Ser Gly 

AGA AAC 
Arg Asn 



765 

TCG CCG GAT ACG 
Ser Pro Asp Thr 
780 

CCA CAG GTC GGC 
Pro Gin Val Gly 

795 - 

AGC CTG CTT GGA 
Ser Leu Leu Gly 



CCA GGT GAC CTC 
Pro Gly Asp Leu 
970 

AAC CAC GTG CTG 
Asn His Val Leu 
985 

GAA ACT TTG GAC 
Glu Thr Leu Asp 
1000 

TTC TTT AAA GGG 
Phe Phe Lys Gly 
1015 

AAG AGG CCC AGA 



ATG AAG 
Met Lys 

CAG AGG 
Gin Arg 

CTT CCA 
Leu Pro 
860 
CTC ATA 
Leu He 
875 

GGC GAG 
Gly Glu 

AAC AGG 
Asn Arg 

AGG CAC 
Arg His 

AGG AGA 
Arg Arg 
940 
GGG GAG 
Gly Glu 
955 

GTT GCA 
Val Ala 



GCC ACG 
Ala Thr 
830 
GCC ATC 
Ala He 
845 

GTC CTC 
Val Leu 

GAC CGG 
Asp Arg 

ACG GAA 
Thr Glu 

AGA ACT 
Arg Thr 
910 
GAT TAT 
Asp Tyr 
925 

ATA AAG 
lie Lys 

CTC GTT 
Leu Val 

GTC CCG 
Val Pro 



AAC CTC GTT GAA 
Asn Leu Val Glu 
990 

ATC GTC ATG ACG 
He Val Met Thr 
1005 

ATG CTC AGG ACT 
Met Leu Arg Thr 
1020 

ACC GCG AGA CGC 



CTC AAC 
Leu Asn 

CAC CGC 
His Arg 
800 
GAC CTC 
Asp Leu 
815 

ATT GAC 
He Asp 

AAG ATC 
Lys He 

GAG GAA 
Glu Glu 

ATG ATG 
Met Met 
880 
GTG CTT 
Val Leu 
895 

AAC AAG 
Asn Lys 

TCT GGC 
Ser Gly 

ATA ACC 
He Thr 

GAA GTT 
Glu Val 
960 
CGG AGA 
Arg Arg 
975 

CTG CTC 
Leu Leu 

ATC CCA 
He Pro 

TTG CGC 
Leu Arg 

TAT CTC 



770 

AGA GAA GGA 

Arg Glu Gly 
785 

TTC TGC AAG 

Phe Cys Lys 

CTA GAG GAG 
Leu Glu Glu 

CCG ATC GAG 
Pro He Glu 
835 

CTG GCA AAC 
Leu Ala Asn 

850 
GGG GAG GTT 
Gly Glu Val 
865 

GAG GAA AAT 
Glu Glu Asn 

GAG GTC AGT 
Glu Val Ser 

GCC GAG CTC 
Ala Glu Leu 
915 

AAG GTC TAC 
Lys Val Tyr 

930 
TCT GGC CAC 
Ser Gly His 
945 

ACG GGC GAT 
Thr Gly Asp 

TTG GAG CTT 
Leu Glu Leu 

CTT GGA ACG 
Leu Gly Thr 
995 

GTC AAG GGT 
Val Lys Gly 

1010 
TGG ATT TTC 
Trp He Phe 
1025 

AGG CAC CTT 



TGC AAG GAA 

Cys Lys Glu 
790 

GAC TTC CCA 

Asp Phe Pro 

805 

AGG CAG AAG 

Arg Gin Lys 
820 

AGG AAG CTC 

Arg Lys Leu 

AGC ATC CTA 
Ser lie Leu 

CAC TTC GTC 

His Phe Val 
870 

GCT GGG AAA 

Ala Gly Lys 

885 

GGG CTT GAA 

Gly Leu Glu 
900 

AAG AGA GTA 

Lys Arg Val 

ACC ATC AGA 
Thr He Arg 

AGC CTC TTC 
Ser Leu Phe 
950 

GAA CTA AAG 
Glu Leu Lys 

965 
CCT GAG AGA 
Pro Glu Arg 
980 

CCA GAA GAA 
Pro Glu Glu 

AAG AAG AAC 
Lys Lys Asn 

GGA GAG GAA 
Gly Glu Glu 

1030 
GAG GAT CTG 



2525 



2573 



2621 



2669 



2717 



2765 



2813 



2861 



2909 



9Qc;7 



3005 



3053 



3101 



3149 



3197 



3245 



3293 
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Lys Arg Pro Arg Thr Ala Arg Arg Tyr Leu Arg His Leu Glu Asp Leu 

1035 1040 1045 

GGC TAT GTC CGG CTT AAG AAG ATC GGC TAC GAA GTC CTC GAC TGG GAC 3341 
Gly Tyr Val Arg Leu Lys Lys He Gly Tyr Glu Val Leu Asp Trp Asp 

1050 1055 1060 

TCA CTT AAG AAC TAC AGA AGG CTC TAC GAG GCG CTT GTC GAG AAC GTC 3389 
Ser Leu Lys Asn Tyr Arg Arg Leu Tyr Glu Ala Leu Val Glu Asn Val 

1065 1070 1075 

AGA TAC AAC GGC AAC AAG AGG GAG TAC CTC GTT GAA TTC AAT TCC ATC 3437 
Arg Tyr Asn Gly Asn Lys Arg Glu Tyr Leu Val Glu Phe Asn Ser He 

1080 1085 1090 

CGG GAT GCA GTT GGC ATA ATG CCC CTA AAA GAG CTG AAG GAG TGG AAG 3485 
Arg Asp Ala Val Gly He Met Pro Leu Lys Glu Leu Lys Glu Trp Lys 
1095 1100 1105 1110 

ATC GGC ACG CTG AAC GGC TTC AGA ATG AGA AAG CTC ATT GAA GTG GAC 3533 
He Gly Thr Leu Asn Gly Phe Arg Met Arg Lys Leu He Glu Val Asp 

1115 1120 1125 

GAG TCG TTA GCA AAG CTC CTC GGC TAC TAC GTG AGC GAG GGC TAT GCA 3581 
Glu Ser Leu Ala Lys Leu Leu Gly Tyr Tyr Val Ser Glu Gly Tyr Ala 

1130 1135 1140 

AGA AAG CAG AGG AAT CCC AAA AAC GGC TGG AGC TAC AGC GTG AAG CTC 3629 
Arg Lys Gin Arg Asn Pro Lys Asn Gly Trp Ser Tyr Ser Val Lys Leu 

1145 1150 1155 

TAC AAC GAA GAC CCT GAA GTG CTG GAC GAT ATG GAG AGA CTC GCC AGC 3677 
Tyr Asn Glu Asp Pro Glu Val Leu Asp Asp Met Glu Arg Leu Ala Ser 

1160 1165 1170 

AGG TTT TTC GGG AAG GTG AGG CGG GGC AGG AAC TAC GTT GAG ATA CCG 3725 
Arg Phe Phe Gly Lys Val Arg Arg Gly Arg Asn Tyr Val Glu He Pro 
1175 1180 1185 1190 

AAG AAG ATC GGC TAC CTG CTC TTT GAG AAC ATG TGC GGT GTC CTA GCG 3773 
Lys Lys He Gly Tyr Leu Leu Phe Glu Asn Met Cys Gly Val Leu Ala 

1195 1200 1205 

GAG AAC AAG AGG ATT CCC GAG TTC GTC TTC ACG TCC CCG AAA GGG GTT 3821 
Glu Asn Lys Arg He Pro Glu Phe Val Phe Thr Ser Pro Lys Gly Val 

1210 1215 1220 

CGG CTG GCC TTC CTT GAG GGG TAC TCA TCG GCG ATG GCG ACG TCC ACC 3869 
Arg Leu Ala Phe Leu Glu Gly Tyr Ser Ser Ala Met Ala Thr Ser Thr 

1225 1230 1235 

GAA CAA GAG ACT CAG GCT CTC AAC GAA AAG CGA GCT TTA GCG AAC CAG 3917 
Glu Gin Glu Thr Gin Ala Leu Asn Glu Lys Arg Ala Leu Ala Asn Gin 

1240 1245 1250 

CTC GTC CTC CTC TTG AAC TCG GTG GGG GTC TCT GCT GTA AAA CTT GGG 3965 
Leu Val Leu Leu Leu Asn Ser Val Gly Val Ser Ala Val Lys Leu Gly 
1255 1260 1265 1270 

CAC GAC AGC GGC GTT TAC AGG GTC TAT ATA AAC GAG GAG CTC CCG TTC 4013 
His Asp Ser Gly Val Tyr Arg Val Tyr lie Asn Glu Glu Leu Pro Phe 

1275 1280 1285 

GTA AAG CTG GAC AAG AAA AAG AAC GCC TAC TAC TCA CAC GTG ATC CCC 4061 
Val Lys Leu Asp Lys Lys Lys Asn Ala Tyr Tyr Ser His Val He Pro 
1290 1295 1300 
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AAG GAA GTC CTG AGC GAG GTC TTT GGG AAG GTT TTC CAG AAA AAC GTC 4109 

Lys Glu Val Leu Ser Glu Val' Phe Gly Lys Val Phe Gin Lys Asn Val 

1305 1310 1315 

ACT CCT CAG ACC TTC AGG AAG ATG GTC GAG GAC GGA AGA CTC GAT CCC 4157 

Ser Pro Gin Thr Phe Arg Lys Met Val Glu Asp Gly Arg Leu Asp Pro 

1320 1325 1330 

GAA AAG GCC CAG AGG CTC TCC TGG CTC ATT GAG GGG GAC GTA GTG CTC 4205 

Glu Lys Ala Gin Arg Leu Ser Trp Leu He Glu Gly Asp Val Val Leu 

1335 1340 1345 1350 

GAC CGC GTT GAG TCC GTT GAT GTG GAA GAC TAC GAT GGT TAT GTC TAT 4253 

Asp Arg Val Glu Ser Val Asp Val Glu Asp Tyr Asp Gly Tyr Val Tyr 

1355 1360 1365 

GAC CTG AGC GTC GAG GAC AAC GAG AAC TTC CTC GTT GGC TTT GGG TTG 4301 

Asp Leu Ser Val Glu Asp Asn Glu Asn Phe Leu Val Gly Phe Gly Leu 

1370 1375- 1380 

GTC TAT GCT CAC AAC AGC TAC TAC GGT TAC TAC GGC TAT GCA AGG GCG 4349 

VaLTyr Ala His Asn Ser Tyr Tyr Gly Tyr Tyr Gly Tyr Ala Arg Ala 

1385 1390 1395 

CGC TGG TAC TGC AAG GAG TGT GCA GAG AGC GTA ACG GCC TGG GGA AGG 4397 

Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser Val Thr Ala Trp Gly Arg 

1400= ■ 1405 1410 

GAG TAC ATA ACG ATG ACC ATC AAG GAG ATA GAG GAA AAG TAC GGC TTT 4445 

Glu Tyr He Thr Met Thr He Lys Glu He Glu Glu Lys Tyr Gly Phe 

1415 1420 1425 1430 

AAG GTA ATC TAC AGC GAC ACC GAC GGA TTT TTT GCC ACA ATA CCT GGA 4493 

Lys Val He Tyr Ser Asp Thr Asp Gly Phe Phe Ala Thr lie Pro Gly 

1435 1440 1445 

GCC GAT GCT GAA ACC GTC AAA AAG AAG GCT ATG GAG TTC CTC AAC TAT 4541 

Ala Asp Ala Glu Thr Val Lys Lys Lys Ala Met Glu Phe Leu Asn Tyr 

1450 1455 1460 

ATC AAC GCC AAA CTT CCG GGC GCG CTT GAG CTC GAG TAC GAG GGC TTC 4589 

He Asn Ala Lys Leu Pro Gly Ala Leu Glu Leu Glu Tyr Glu Gly Phe 

1465 1470 1475 

TAC AAA CGC GGC TTC TTC GTC ACG AAG AAG AAG TAT GCG GTG ATA GAC 4637 

Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val He Asp 

1480 1485 1490 

GAG GAA GGC AAG ATA ACA ACG CGC GGA CTT GAG ATT GTG AGG CGT GAC 4685 

Glu Glu Gly Lys He Thr Thr Arg Gly Leu Glu He Val Arg Arg Asp 

1495 1500 1505 1510 

TGG AGC GAG ATA GCG AAA GAG ACG CAG GCG AGG GTT CTT GAA GCT TTG 4733 

Trp Ser Glu He Ala Lys Glu Thr Gin Ala Arg Val Leu Glu Ala Leu 

1515 1520 1525 

CTA AAG GAC GGT GAC GTC GAG AAG GCC GTG AGG ATA GTC AAA GAA GTT 4781 

Leu Lys Asp Gly Asp Val Glu Lys Ala Val Arg He Val Lys Glu Val 

1530 1535 1540 

ACC GAA AAG CTG AGC AAG TAC GAG GTT CCG CCG GAG AAG CTG GTG ATC 4829 

Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val He 

1545 1550 1555 

CAC GAG CAG ATA ACG AGG GAT TTA AAG GAC TAC AAG GCA ACC GGT CCC 4877 

His Glu Gin He Thr Arg Asp Leu Lys Asp Tyr Lys Ala Thr Gly Pro 
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1560 1565 1570 

CAC GTT GCC GTT GCC AAG AGG TTG GCC GCG AGA GGA GTC AAA ATA CGC 4925 
His Val Ala Val Ala Lys Arg Leu Ala Ala Arg Gly Val Lys lie Arg 
1575 1580 1585 1590 

CCT GGA ACG GTG ATA AGC TAC ATC GTG CTC AAG GGC TCT GGG AGG ATA 4973 
Pro Gly Thr Val lie Ser Tyr He Val Leu Lys Gly Ser Gly Arg He 

1595 1600 1605 

GGC GAC AGG GCG ATA CCG TTC GAC GAG TTC GAC CCG ACG AAG CAC AAG 5021 
Gly Asp Arg, Ala He Pro Phe Asp Glu Phe Asp Pro Thr Lys His Lys 

1610 1615 1620 

TAC GAC GCC GAG TAC TAC ATT GAG AAC CAG GTT CTC CCA GCC GTT GAG 5069 
Tyr Asp Ala Glu Tyr Tyr He Glu Asn Gin Val Leu Pro Ala Val Glu 

1625 1630 1635 

AGA ATT CTG AGA GCC TTC GGT TAC CGC AAG GAA GAC CTG CGC TAC CAG 5117 
Arg He Leu Arg Ala Phe Gly Tyr Arg Lys Glu Asp Leu Arg Tyr Gin 

1640 1645 1650 

AAG ACG AGA CAG GTT GGT TTG AGT GCT-TGG CTG AAG CCG AAG GGA ACT 5165 
Lys Thr Arg Gin Val Gly Leu Ser Ala Trp Leu Lys Pro Lys Gly Thr 
1655 1660 1665 1670 

TGACCTTTCC ATTTGTTTTC CAGCGGATAA CCCTTTAACT TCCCTTTCAA AAACTCCCT 5225 
■ TAGGGAAAGA CCATGAAGAT AGAAATCCGG CGGCGCCCGG TTAAATACGC TAGGATAGA 5285 
GTGAAGCCAG ACGGCAGGGT AGTCGTCACT GCCCCGAGGG TTCAACGTTG AGAAGTT 5342 
[0 0 3 2] E?i]#^2 h#ns?- : EI84£ 

mmoms : 774 mmomm : *>wn 

mm 

Met He Leu Asp Thr Asp Tyr He Thr Glu Asp Gly Lys Pro Val He 

15 10 15 

Arg He Phe Lys Lys Glu Asn Gly Glu Phe Lys He Glu Tyr Asp Arg 

20 25 30 

Thr Phe Glu Pro Tyr Phe Tyr Ala Leu Leu Lys Asp Asp Ser Ala He 

35 40 45 

Glu Glu Val Lys Lys He Thr Ala Glu Arg His Gly Thr Val Val Thr 

50 55 60 

Val Lys Arg Val Glu Lys Val Gin Lys Lys Phe Leu Gly Arg Pro Val 
65 70 75 80 

Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gin Asp Val Pro Ala He 

85 90 95 

Arg Asp Lys He Arg Glu His Gly Ala Val He Asp He Tyr Glu Tyr 

100 105 110 

Asp He Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu Val Pro 

115 120 125 

Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp He Gin Thr 

130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro He Leu Met He 
145 150 155 160 

Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val He Thr Trp Lys Asn Val 

165 170 175 

Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg Glu Met He Lys 
180 185 190 
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Arg Phe Leu. Arg Val Val Lys Glu Lys Asp Pro Asp Val Leu He Thr 

195 200 205 

Tyr Asd Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu 

210 215 220 

Lys Leu Gly lie Asn Phe Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys 
225 230 235 240 

He Gin Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg He 

245 250 255 

His Phe Asp Leu Tyr Pro Val He Arg Arg Thr He Asn Leu Pro Thr 

260 265 270 

Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Gin Pro Lys Glu 

275 280 285 

Lys Val Tyr Ala Glu Glu He Thr Pro Ala Trp Glu Thr Gly Glu Asn 

290 295 300 

Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 
305 310 315 320 

Glu Leu Gly Lys Glu Phe Leu Pro Met Glu Ala Gin Leu Ser Arg Leu 

325 330 335 

He Gly Gin Ser Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 

340 345 350 

Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 

355 360 365 

Pro Asn Lys Pro Asp Glu Lys Glu Leu Ala Arg Arg Arg Gin Ser Tyr 

370 375 380 

Glu Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn He 
385 390 395 400 

Val Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser lie He He Thr His 

405 410 415 

Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu Tyr Asp 

420 425 430 

Val Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 

435 440 445 

He Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gin Lys He Lys 

450 455 460 

Lys Lys Met Lys Ala Thr He Asp Pro lie Glu Arg Lys Leu Leu Asp 
465 470 475 480 

Tyr Arg Gin Arg Ala lie Lys He Leu Ala Asn Ser Tyr Tyr Gly Tyr 

485 490 495 

Tyr Gly Tyr Ala Arg Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 

500 505 510 

Val Thr Ala Trp Gly Arg Glu Tyr He Thr Met Thr He Lys Glu He 

515 520 525 

Glu Glu Lys Tyr Gly Phe Lys Val He Tyr Ser Asp Thr Asp Gly Phe 

530 535 540 

Phe Ala Thr He Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 
545 550 555 560 

Met Glu Phe Leu Asn Tyr He Asn Ala Lys Leu Pro Gly Ala Leu Glu 

565 570 575 

Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys 
580 585 590 
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Lys Tyr Ala 5 Val He Asp Glu Glu Gly Lys lie Thr Thr Arg Gly Leu 

595 600 605 

Glu He Val Arg Arg Asp Trp Ser Glu He Ala Lys Glu Thr Gin Ala 

610 615 620 

Arg Val Leu Glu Ala Leu Leu Lys Asp Gly Asp Val Glu Lys Ala Val 
625 630 635 640 

Arg lie Val Lys Glu Val Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro 

645 650 655 

Pro Glu Lys Leu Val He His Glu Gin He. Thr Arg Asp Leu Lys Asp 

660 665 670 

Tyr Lys Ala Thr Gly Pro His Val Ala Val Ala Lys Arg Leu Ala Ala 

675 680 685 

Arg Gly Val Lys He Arg Pro Gly Thr Val lie Ser Tyr He Val Leu 

690 695 700 

Lys Gly Ser Gly Arg He Gly Asp Arg Ala He Pro Phe Asp Glu Phe 
705 710 715: 720 

Asp Pro Thr Lys His Lys Tyr Asp Ala Glu Tyr Tyr He Glu Asn Gin 

725 730 735 

Val Leu Pro Ala Val Glu Arg He Leu Arg Ala Phe Gly Tyr Arg Lys 

740 . 745' 750 

Glu Asp Leu Arg Tyr Gin Lys- Thr Arg, Gin Val Gly Leu Ser Ala Trp 

755 760 765 

Leu Lys Pro Lys Gly Thr - 
770 

[0 0 3 3] mm^3 bD^v-:lfigm 
E^J<7)fi$ : 5342 ffi?UO«S:cDNA 

mmm : mm (dna) mm : m&mt&mm 

fi<2)ft : 2*ft| : KOD 1 

mm 

GCTTGAGGGC CTGCGGTTAT GGGACGTTGC AGTTTGCGCC TACTCAAAGA TGCCGGTTTT 60 
ATAACGGAGA AAAATGGGGA GCTATTACGA TCTCTCCTTG ATGTGGGGTT TACAATAAAG 120 
CCTGGATTGT TCTACAAGAT TATGGGGGAT GAAAGATGAT CCTCGACACT GACTACATAA 180 
CCGAGGATGG AAAGCCTGTC ATAAGAATTT TCAAGAAGGA AAACGGCGAG TTTAAGATTG 240 
AGTACGACCG GACTTTTGAA CCCTACTTCT ACGCCCTCCT GAAGGACGAT TCTGCCATTG 300 
AGGAAGTCAA GAAGATAACC GCCGAGAGGC ACGGGACGGT TGTAACGGTT AAGCGGGTTG 360 
AAAAGGTTCA GAAGAAGTTC CTCGGGAGAC CAGTTGAGGT CTGGAAACTC TACTTTACTC 420 
ATCCGCAGGA CGTCCCAGCG ATAAGGGACA AGATACGAGA GCATGGAGCA GTTATTGACA 480 
TCTACGAGTA CGACATACCC TTCGCCAAGC GCTACCTCAT AGACAAGGGA TTAGTGCCAA 540 
TGGAAGGCGA CGAGGAGCTG AAAATGCTCG CCTTCGACAT TCAAACTCTC TACCATGAGG 600 
GCGAGGAGTT CGCCGAGGGG CCAATCCTTA TGATAAGCTA CGCCGACGAG GAAGGGGCCA 660 
GGGTGATAAC TTGGAAGAAC GTGGATCTCC CCTACGTTGA CGTCGTCTCG ACGGAGAGGG 720 
AGATGATAAA GCGCTTCCTC CGTGTTGTGA AGGAGAAAGA CCCGGACGTT CTCATAACCT 780 
ACAACGGCGA CAACTTCGAC TTCGCCTATC TGAAAAAGCG CTGTGAAAAG CTCGGAATAA 840 
ACTTCGCCCT CGGAAGGGAT GGAAGCGAGC CGAAGATTCA GAGGATGGGC GACAGGTTTG 900 
CCGTCGAAGT GAAGGGACGG ATACACTTCG ATCTCTATCC TGTGATAAGA CGGACGATAA 960 
ACCTGCCCAC ATACACGCTT GAGGCCGTTT ATGAAGCCGT CTTCGGTCAG CCGAAGGAGA 1020 
AGGTTTACGC TGAGGAAATA ACACCAGCCT GGGAAACCGG CGAGAACCTT GAGAGAGTCG 1080 
CCCGCTACTC GATGGAAGAT GCGAAGGTCA CATACGAGCT TGGGAAGGAG TTCCTTCCGA 1140 
TGGAGGCCCA GCTTTCTCGC TTAATCGGCC AGTCCCTCTG GGACGTCTCC CGCTCCAGCA 1200 
CTGGCAACCT CGTTGAGTGG TTCCTCCTCA GGAAGGCCCT ATGAGAGGAA TGAGCTGGCC 1260 
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CCGAACAAGC 


CCGATGAAAA 


GGAGCTGGCC 


AGAAGACGGC 


AGAGCTATGA 


AGGAGGCTAT 


1320 


GTAAAAGAGC 


CCGAGAGAGG 


GTTGTGGGAG 


AACATAGTGT 


ACCTAGATTT 


TAGATGCCAT 


1380 


CCAGCCGATA 


CGAAGGTTGT 


CGTCAAGGGG 


AAGGGGATTA 


TAAACATCAG 


CGAGGTTCAG 


1440 


GAAGGTGACT 


ATGTCCTTGG 


GATTGACGGC 


TGGCAGAGAG 


TTAGAAAAGT 


ATGGGAATAC 


1500 


GACTACAAAG 


GGGAGCTTGT 


AAACATAAAC 


GGGTTAAAGT 


GTACGCCCAA 


TCATAAGCTT 


1560 


CCCGTTGTTA 


CAAAGAACGA 


ACGACAAACG 


AGAATAAGAG 


ACAGTCTTGC 


TAAGTCTTTC 


1620 


CTTACTAAAA 


AAGTTAAGGG 


CAAGATAATA 


ACCACTCCCC 


TTTTCTATGA 


AATAGGCAGA, 1680 


GCGACAAGTG 


AGAATATTCC 


AGAAGAAGAG 


GTT.CTCAACG 


GAGAGCTCGC 


TGGCAT ACTA, 1740 


TTGGCTGAAG 


GAACGCTCTT 


GAGGAAAGAC 


GTTGAATACT 


TTGATTCATC 


CCGCAAAAAA 


1800 


CGGAGGATTT 


CACACCAGTA 


TCGTGTTGAG 


ATAACCATTG 


GGAAAGACGA 


GGAGGAGTTT 


1860 


AGGGATCGTA 


TCACATACAT 


TTTTGAGCGT 


TTGTTTGGGA 


TTACTCCAAG 


CATCTCGGAG 


1920 


AAGAAAGGAA 


CTAACGCAGT 


AACACTCAAA 


GTTGCGAAGA 


AGAATGTTTA 


TCTTAAAGTC 


1980 


AAGGAAATTA 


TGGACAACAT 


AGAGTCCCTA 


CATGCCCCCT 


CGGTTCTCAG 


GGGATTCTTC 


2040 


GAAGGCGACG 


GTTCAGTAAA 


* CAGGTTAGGA 


GGAGTATTGT 


TGCAACCCAG 


GGTACAAAGA 


2100 


ACGAGTGGAA 


GATTAAACTG 


GTGTCAAAAC 


TGCTCTCCCA 


GCTTGGTATC 


CCTCATCAAA 


2160 


CGTACACGTA 


TCAGTATCAG 


GAAAATGGGA 


AAGATCGGAG 


CAGGTATATA 


CTGGAGATAA 


2220 


CTGGAAAGGA 


CGGATTGATA 


. CTGTTCCAAA 


CACTCATTGG 


ATTCATCAGT 


GAAAGAAAGA 


2280 


ACGCTCTGCT 


TAATAAGGCA 


ATATCTCAGA 


GGGAAATGAA 


CAACTTGGAA 


AACAATGGAT 


2340 


TTTACAGGCT 


CAGTGAATTC 


AATGTCAGCA 


CGGAATACTA 


TGAGGGCAAG 


GTCTATGACT 


2400 


TAACTCTTGA 


AGGAACTCCC 


TACTTTGCCA 


ATGGCATATT 


GACCCATAAC 


TCCCTGTACC 


2460 


CCTCAATCAT 


CATCACCCAC 


AACGTCTCGC 


CGGATACGCT 


CAACAGAGAA 


GGATGCAAGG 


2520 


AATATGACGT 


TGCCCCACAG 


GTCGGCCACC 


GCTTCTGCAA 


GGACTTCCCA 


GGATTTATCC 


2580 


CGAGCCTGCT 


TGGAGACCTC 


CTAGAGGAGA 


GGCAGAAGAT 


AAAGAAGAAG 


ATGAAGGCCA 


2640 


CGATTGACCC 


GATCGAGAGG 


AAGCTCCTCG 


ATTACAGGCA 


GAGGGCCATC 


AAGATCCTGG 


2700 


CAAACAGCAT 


CCTACCCGAG 


GAATGGCTTC 


CAGTCCTCGA 


GGAAGGGGAG 


GTTCACTTCG 


2760 


TCAGGATTGG 


AGAGCTCATA 


GACCGGATGA 


TGGAGGAAAA 


TGCTGGGAAA 


GTAAAGAGAG 


2820 


AGGGCGAGAC 


GGAAGTGCTT 


GAGGTCAGTG 


GGCTTGAAGT 


CCCGTCCTTT 


AACAGGAGAA 


2880 


CTAACAAGGC 


CGAGCTCAAG 


AGAGTAAAGG 


CCCTGATTAG 


GCACGATTAT 


TCTGGCAAGG 


2940 


TCTACACCAT 


CAGACTGAAG 


TCGGGGAGGA 


GAATAAAGAT 


AACCTCTGGC 


CACAGCCTCT 


3000 


TCTCTGTGAG 


AAACGGGGAG 


CTCGTTGAAG 


TTACGGGCGA 


TGAACTAAAG 


CCAGGTGACC 


3060 


TCGTTGCAGT 


CCCGCGGAGA 


TTGGAGCTTC 


CTGAGAGAAA 


CCACGTGCTG 


AACCTCGTTG 


3120 


AACTGCTCCT 


TGGAACGCCA 


GAAGAAGAAA 


CTTTGGACAT 


CGTCATGACG 


ATCCCAGTCA 


3180 


AGGGTAAGAA 


GAACTTCTTT 


AAAGGGATGC 


TCAGGACTTT 


GCGCTGGATT 


TTCGGAGAGG 


3240 


AAAAGAGGCC 


CAGAACCGCG 


AGACGCTATC 


TCAGGCACCT 


TGAGGATCTG 


GGCTATGTCC 


3300 


GGCTTAAGAA 


GATCGGCTAC 


GAAGTCCTCG 


ACTGGGACTC 


ACTTAAGAAC 


TACAGAAGGC 


3360 


TCTACGAGGC 


GCTTGTCGAG 


AACGTCAGAT 


ACAACGGCAA 


CAAGAGGGAG 


TACCTCGTTG 


3420 


AATTCAATTC 


CATCCGGGAT 


GCAGTTGGCA 


TAATGCCCCT 


AAAAGAGCTG 


AAGGAGTGGA 


3480 


AGATCGGCAC 


GCTGAACGGC 


TTCAGAATGA 


GAAAGCTCAT 


TGAAGTGGAC 


GAGTCGTTAG 


3540 


CAAAGCTCCT 


CGGCTACTAC 


GTGAGCGAGG 


GCTATGCAAG 


AAAGCAGAGG 


AATCCCAAAA 


3600 


ACGGCTGGAG 


CTACAGCGTG 


AAGCTCTACA 


ACGAAGACCC 


TGAAGTGCTG 


GACGATATGG 


3660 


AGAGACTCGC 


CAGCAGGTTT 


TTCGGGAAGG 


TGAGGCGGGG 


CAGGAACTAC 


GTTGAGATAC 


3720 


CGAAGAAGAT 


CGGCTACCTG 


CTCTTTGAGA 


ACATGTGCGG 


TGTCCTAGCG 


GAGAACAAGA 


3780 


GGATTCCCGA 


GTTCGTCTTC 


ACGTCCCCGA 


AAGGGGTTCG, GCTGGCCTTC 


CTTGAGGGGT 


3840 


ACTCATCGGC 


GATGGCGACG 


TCCACCGAAC 


AAGAGACTCA GGCTCTCAAC 


GAAAAGCGAG 


3900 


CTTTAGCGAA 


CCAGCTCGTC 


CTCCTCTTGA 


ACTCGGTGGG GGTCTCTGCT 


GTAAAACTTG 


3960 


GGCACGACAG 


CGGCGTTTAC 


AGGGTCTATA 


TAAACGAGGA GCTCCCGTTC 


GTAAAGCTGG 


4020 


ACAAGAAAAA 


GAACGCCTAC 


TACTCACACG 


TGATCCCCAA GGAAGTCCTG 


AGCGAGGTCT 


4080 


TTGGGAAGGT 


TTTCCAGAAA 


AACGTCAGTC 


CTCAGACCTT 


CAGGAAGATG 


GTCGAGGACG 


4140 


GAAGACTCGA 


TCCCGAAAAG 


GCCCAGAGGC 


TCTCCTGGCT CATTGAGGGG 


GACGTAGTGC 


4200 


TCGACCGCGT 


TGAGTCCGTT 


GATGTGGAAG 


ACTACGATGG TTATGTCTAT 


GACCTGAGCG 


4260 
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TCGAGGACAA CGAGAACTTC CTCGTTGGCT TTGGGTTGGT CTATGCTCAC AACAGCTACT 4320 

ACGGTTACTA CGGCTATGCA AGGGCGCGCT GGTACTGCAA GGAGTGTGCA GAGAGCGTAA 4380 

CGGCCTGGGG AAGGGAGTAC ATAACGATGA CCATCAAGGA GATAGAGGAA AAGTACGGCT 4440 

TTAAGGTAAT CTACAGCGAC ACCGACGGAT TTTTTGCCAC AATACCTGGA GCCGATGCTG 4500 

AAACCGTCAA AAAGAAGGCT ATGGAGTTCC TCAACTATAT CAACGCCAAA CTTCCGGGCG 4560 

CGCTTGAGCT CGAGTACGAG GGCTTCTACA AACGCGGCTT CTTCGTCACG AAGAAGAAGT 4620 

ATGCGGTGAT AGACGAGGAA GGCAAGATAA CAACGCGCGG ACTTGAGATT GTGAGGCGTG 4680 

ACTGGAGCGA GATAGCGAAA GAGACGCAGG CGAGGGTTCT TGAAGCTTTG CTAAAGGACG 4740 

GTGACGTCGA GAAGGCCGTG AGGATAGTCA AAGAAGTTAC CGAAAAGCTG AGCAAGTACG 4800 

AGGTTCCGCC GGAGAAGCTG GTGATCCACG AGCAGATAAC GAGGGATTTA AAGGACTACA 4860 

AGGCAACCGG TCCCCACGTT GCCGTTGCCA AGAGGTTGGC CGCGAGAGGA GTCAAAATAC 4920 

GCCCTGGAAC GGTGATAAGC TACATCGTGC TCAAGGGCTC TGGGAGGATA GGCGACAGGG 4980 

CGATACCGTT CGACGAGTTC GACCCGACGA AGCACAAGTA CGATGCCGAG TACTACATTG 5040 

AGAACCAGGT TCTCCCAGCC GTTGAGAGAA TTCTGAGAGC CTTCGGTTAC CGCAAGGAAG 5100 

ACCTGCGCTA CCAGAAGACG AGACAGGTTG GTTTGAGTGC TTGGCTGAAG CCGAAGGGAA 5160 

CTTGACCTTT CCATTTGTTT TCCAGCGGAT AACCCTTTAA CTTCCCTTTC AAAAACTCCC 5220 

TTTAGGGAAA GACCATGAAG ATAGAAATCC GGCGGCGCCC GGTTAAATAC GCTAGGATAG 5280 

AAGTGAAGCC AGACGGCAGG GTAGTCGTCA CTGCCCCGAG GGTTCAACGT TGAGAAGTT 5339 
[0 0 3 4] E?iJS^4 h*Dy- : fifitt 
B25UO:ft£ : 24 E^JOffig : MDNA 

be^ijo^ : mm 

GGATTAGTGC CAATGGAAGG CGAC 24 

[0 0 3 5] E^J#^5 h*Py- : dtifltt 

E^J£>ft2 : 24 E^JCOftS : MDNA 

&m<DM : mm 

mm 

GAGGGCGAAG TTTATTCCGA GCTT 24 

[0036] mmn 6 m<v& : 2 #£S 

E?«£>g2 : 324 hD#y- : fifltt 

E^JOS : &St (DNA) MFKDmm : c DNA 

B*l 

GGATTAGTGC CAATGGAAGG CGACGAGGAG CTGAAAATGC TCGCCTTCGA CATTCAAACT 60 
CTCTACCATG AGGGCGAGGA GTTCGCCGAG GGGCCAATCC TTATGATAAG CTACGCCGAC 120 
GAGGAAGGGG CCAGGGTGAT AACTTGGAAG AACGTGGATC TCCCCTACGT TGACGTCGTC 180 
TCGACGGAGA GGGAGATGAT AAAGCGCTTC CTCCGTGTTG TGAAGGAGAA AGACCCGGAC 240 
GTTCTCATAA CCTACAACGG CGACAACTTC GACTTCGCCT ATCTGAAAAA GCGCTGTGAA 300 
AAGCTCGGAA TAAACTTCGC CCTC 324 
[0 0 3 7] i?ij#^7 h#nS?-:tt«tt 

e^noss : 108 &m<Dmm : 9>rt9% 

E^JOS! : 

Gly Leu Val Pro Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe 

15 10 15 

Asp He Gin Thr Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro 

20 25 30 

He Leu Met lie Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val He Thr 

35 40 45 

Trp Lys Asn Val Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg 
50 55 60 
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Glu Met lie Lys Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp 
65 70 75 80 

Val Leu He Thr Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys 

85 90 95 

Lys Arg Cys Glu Lys Leu Gly He Asn Phe Ala Leu 
100 105 
[0 0 3 8] E?U#^8 l*m 

ejij©:S£ : 42 mmomm ■. ^dna 

&P\<DM : WLf& (DNA) 

ear . 

GCCATCAAGA TCCTGGCAAA CAGCTACTAC GGTTACTACG GC 42 
[0 0 3 9] E?iJ#^9 1#£B 
E?iJ<Z>:fi£ : 32 W.m<DWm : ^DNA 

mm<DW :«S (DNA) 

EH 

GATGGATCCA ACTTCTCAAC GTTGAACCCT CG . 32 

[o o 4 o] mnmm o ta©sst: i*m 

E^JWftS : 46 E^IIOfiiSi : dSEDNA 

w.mow : mm (dna) 

EJ!l 

GAACATAGTG TACCTAGATT TTAGATCCCT GTACCCCTCA ATCATC 46 

[o o 4 i] mnmm 1 . i*m 

E?iJCDfi£ : 42 E£iJ©S3I : MDNA 

mm<om : &® (DNA) 

GCCGTAGTAA CCGTAGTAGC TGTTTGCCAG GATCTTGATG GC 42 

[o o 4 2] mmmm 2 i*g 

SBMt^S^ : 33 E?U©a« : ^f£DNA 
E5«J©S! : ttK (DNA) 
E?>J 

ATCGATATCC TCGACACTGA CTACATAACC GAG 33 

CO 0 4 3] E^J#*»1 3 iOi: 1*0! 

E5"J©ftS : 46 E^iJOaiS : "aJSEDN A 
E3&JOS : #Wt (DNA) 

E3*J 

GATGATTGAG GGGTACAGGG ATCTAAAATC TAGGTACACT ATGTTC 46 

[E9BS0fBfJ*&IR93] [04] SJfiMfKOD 1 fiSfflDNA^U ^7 

mi) m&z.%m'<2?-<Dffi&m&m?o — tf»g^t««»tJBtena Pyrococcus furiosusS 

[02] KOD 1 *5!EiHSSM4DNA#iJ — tffr^ #©»f?*ttD N A# U * 7— tfjte^ii.fcZ/Thermococc 

MI^S^t-tt^&lbO^XT&S. us 1 i toral isS3fe©Bt8M4DN A#'J > 7 — ttfie^i: 

[03] KOD 1 &*ff.«ittDNAtf U*7--tri;:J: OJtgSr^T. 

5pcr <Dm^yn-tmn.^m<D^mx^^o 
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imi) 



RmM&x.^**- <pET-pol)cD#m 

IVS-A ivsu 



EonRVL. 



BamlU 



pcr 



1-xoRV 
rl 



hxoRV 

A 



^ rcu 



J L 



| rcu 



□ G 



Haml II 
h 



HcoRV 
I 



LPCR 



BamlU 
U 



OAT Alt*. 



EcoRV 



l.fcoKI.naralltfftfl: 

HamH* 



AT <-* pni ( AiVS-A, A I VS-U.) 



ligation 



TlamH 
-4n 



Ncol 



T7 Promoter 



1*7 Promoter 




Terminator 



blunt, FlamHI 

Ncol blunt 

HamHI 

Terminator 




-BamlU 



pET-pol 
(AIVS-A, AIVS-B) 
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IM2} 




1 : pET-8c ftm 

2: pET-pol(AlV$-A, AIVS-B) jftSS 

3: pET-8c _L3i^ 

4: pET-8c _h;M* X5 

5: pET-poKMVS-A. AIVS-B) 

6: pET-pol<A|VS-A. AlVS-B) iS* X5 



[03] 




1 : Vent itf U > ^ - -fcf (Themtococcus litoralbi Jft) 

2 s pET-pol(AIVS-A, AIVS-B)±S* 

3 : pET-pol(AlvS-A. AIVS-B>±at* x 5 

4 : pET-8c ±.M^ 

5 : pET-Bc±a* x5 



dna# y > 9 — wfrf-mMte 

(SDS-PAGE&) 



DNA4?y ^ 7— if£fflV^PCR(Poiymerase 
Chain Reaction)*;: <fc #ta£iX*:DNAfcfK* 



[04] 

Erol Exa2 End Rl Rl R2 R2 R4 

i ■ ■ i H^aa mmmmqu 



Pyrococcus furiosuscODNA* \) j ? — t? (Pfu DNA polymerase) 

Eiol Exo2 Enfl Rl B MR5 

RJ 



Thennococcus HtoraIi5<0DNAiK y / 9 — itfe^ (Vent DNA polymerase) 




dna* y > ^-^ae^-oitft 



(SDInt.Cl.6 IKNe*l frrt&iS^t FI &$fiSSfgf5f 

C12R 1:19) 
(C 1 2N 1/21 
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